The Annals of Probability 

2007, Vol. 35, No. 5, 1931-1949 

DOI: 10.1214/009117906000001132 

© Institute of Mathematical Statistics, 2007 



ON SUPERCONVERGENCE OF SUMS OF FREE 
RANDOM VARIABLES 

By Vladislav Kargin 

Courant Institute of Mathematical Sciences 

This paper derives sufficient conditions for superconvergence of 
sums of bounded free random variables and provides an estimate for 
the rate of superconvergence. 

1. Introduction. Free probability theory is an interesting generalization 
of classical probability theory to a noncommutative setting. It was intro- 
duced in the mid-1980's by Voiculescu [18, 19, 20] as a tool for studying type 
111 von Neumann algebras. In many respects, free probability theory par- 
allels classical probability theory. There exist analogues of the central limit 
theorem [19], the law of large numbers [6] and the classification of infinitely 
divisible and stable laws [3, 7]. On the other hand, certain features of free 
and classical probability theories differ strikingly. Let Sn = n~^^'^J27=i^ij 
where Xi are identically distributed and free random variables. Then the 
law of Sn approaches the limit law in a completely different manner than 
in the classical case. To illustrate this, suppose that the support of Xi is 
[—1, 1]. Take a positive number a < 1. Then, in the classical case, the prob- 
ability of {\Sn\ > an} is exponentially small, but not zero. In contrast, in 
the noncommutative case, the probability becomes identically zero for all 
sufficiently large n. This mode of convergence is called superconvergence in 
[5]. 

In this paper, we extend the superconvergence result to a more general 
setting of nonidentically distributed variables and estimate the rate of super- 
convergence quantitatively. It turns out, in particular, that the support of 
Sn can deviate from the supporting interval of the limiting law by not more 
than cj^fn and we explicitly estimate the constant c. An example shows 
that the rate n~^/^ in this estimate cannot be improved. 
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Related results have been obtained in the random matrix literature. For 
example, [10] considers the distribution of the largest eigenvalue of an em- 
pirical covariance matrix for a sample of Gaussian vectors. This problem 
can be seen as a problem concerning the edge of the spectrum of a sum 
of n random rank-one operators in the A^-dimensional vector space. More 
precisely, the question concerns sums of the form 5„ = Z^iLi^i^i; where Xi 
is a random A^-vector with entries distributed according to the Gaussian 
law with the normalized variance Then is a matrix- valued random 

variable with the Wishart distribution. 

Johnstone is interested in the asymptotic behavior of the distribution 
of the largest eigenvalue of 5„. The asymptotics are derived under the 
assumptions that both n and approach oo and that limn/iV = 7 > 0, 
7 7^ cxo. Johnstone found that the largest eigenvalue has variance of the or- 
der and that after an appropriate normalization, the distribution of 
the largest eigenvalue approaches the Tracy-Widom law. This law has a 
right-tail asymptotically equivalent to exp[— (2/3)s^/^] and, in particular, 
is unbounded from above. Johnstone's results have generalized the original 
breakthrough results in [16] (see also [17]) for self-adjoint random matrices 
without covariance structure. In [14] and [15], it is shown that the results 
regarding the asymptotic distribution of the largest eigenvalue remain valid 
even if the matrix entries are not necessarily Gaussian. 

An earlier contribution, [2], also considered empirical covariance matri- 
ces of large random vectors that are not necessarily Gaussian and stud- 
ied their largest eigenvalues. Again, both n and A^ approach infinity and 
limn/A^ = 7 > 0, 77^ 00. In contrast to Johnstone, Bai and Silverstein were 
interested in the behavior of the largest eigenvalue along a sequence of in- 
creasing random covariance matrices. Suppose that the support of the limit- 
ing eigenvalue distribution is contained in the interior of a closed interval, /. 
Bai and Silverstein showed that the probability that the largest eigenvalue 
lies outside / is zero for all sufficiently large n. 

These results are not directly comparable with ours for several reasons. 
First, in our case, the edge of the spectrum is not random in the classical 
sense and so it does not make sense to talk about its variance. Second, infor- 
mally speaking, we are looking at the limit situation when N = 00, 00. 
Because of this, we use much easier techniques than all of the aforementioned 
papers, as we do not need to handle the interaction of the randomness and 
the passage to the asymptotic limit. Despite these differences, comparison of 
our results with the results of the random matrix literature is stimulating. 
In particular, superconvergence in free probability theory can be thought as 
an analogue of the Bai~Silverstein result. 

The rest of the paper is organized as follows. Section 2 provides the neces- 
sary background about free probability theory and describes the main result. 
Section 3 recalls some results that we will need in the proof and Section 4 
is devoted to the proof of the main result. 
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2. Main theorem. 

Definition 1. A noncommutative probability space is a pair (A,E), 
where ^ is a unital C*-algebra of bounded hnear operators acting on a com- 
plex separable Hilbert space and is a linear functional from A to complex 
numbers. The operators belonging to the algebra A are called noncommu- 
tative random variables or simply random variables and the functional E is 
called the expectation. 

An algebra of bounded linear operators is a unital C* -algebra if it contains 
the identity operator / and if it is closed with respect to the *-operation, 
that is, if ^ G then A* G A, where A* is the adjoint of operator A. The 
algebra is also assumed to be closed with respect to convergence in the 
operator norm. The definition of noncommutative random variables can be 
generalized to include unbounded linear operators affiliated with algebra A; 
for details, see [4, 11]. In this paper, we restrict our attention to bounded 
random variables. 

The linear functional E is assumed to satisfy the following properties (in 
addition to linearity): (i) E{I) = 1; (ii) E{A*) = E{A); (ih) E{AA*) > 0; 
(iv) E{AB) = E{BA); (v) E{AA*) = implies ^ = 0; and (vi) if An A, 
then E{An) ^ E{A). 

For each self-adjoint operator A, the expectation induces a continuous 
linear functional on the space of continuous functions, Ea '■ f — > Ef{A), and 
by the Riesz theorem, we can write this functional as a Stieltjes' integral 
of / over a measure. We call this measure, n, the measure associated with 
operator A and expectation E. If P{dX) is the spectral resolution associated 
with operator A, then fj,{dX) = E{P{dX)). It is easy to check that /i is a 
probability measure on M. If ^ is a bounded operator and ||A|| < L, then 
the support of /x is contained in the circle |A| < L. 

The most important concept in free probability theory is that of free inde- 
pendence of noncommuting random variables. Let a set of r.v.'s Ai, . . . , An 
be given. With each of them, we can associate an algebra Ai, which is gen- 
erated by Ai] that is, it is the closure of all polynomials in variables Ai and 
A*. Let Ai denote an arbitrary element of algebra Ai. 

Definition 2. The algebras Ai,. .. ,An (and variables Ai,.. .,An that 
generate them) are said to be freely independent or free if the following 
condition holds: 



V'{A{l),---iA{m)) =0, 
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In classical probability theory, one of the most important theorems is 
the central limit theorem (CLT). It has an analogue in noncommutative 
probability theory. 

Proposition 1. Let r.v.'s Xi, i = 1, 2, . . . , he self-adjoint and free. As- 
sume that E{Xi) = 0, \\Xi\\ < L and lim„^oo[-E^(^i ) H h E{Xl)]/n = 02. 

Then measures associated with r.v.'s n~^^'^J2i'=i-^i converge in distribution 
to an absolutely continuous measure with density 

= V 4a2 - (x) ■ 

This result was proven in [18] and later generalized in [11] to unbounded 
identically distributed variables that have a finite second moment. Other 
generalizations can be found in [13] and [21]. 

In the classical case, the behavior of large deviations from the CLT is 
described by the Cramer theorem, the Bernstein inequality and their gen- 
eralizations. It turns out that in the noncommutative case, the behavior 
of large deviations is considerably different. The theorem below gives some 
quantitative bounds on how the distribution of a sum of free random vari- 
ables differs from the limiting distribution. 

Let Xn,i, i = 1, . . . , kn, be a double-indexed array of bounded self-adjoint 
random variables. The elements of each row, . . . , X„ are assumed 

to be free, but are not necessarily identically distributed. Their associated 
probability measures are denoted /U„.j, their Cauchy transforms are Gn,i{z), 

(k) 

their A;th moments are • , and so on. We define Sn = Xn,i + • • • + Xn,k„ 

and the probability measure fin as the spectral probability measure of Sn- 

We are interested in the behavior of probability measure fin as n grows. 

We will assume that the first moments of the random variables Xn^i are 

(2) (2) 

zero and that \\Xn,i\\ < Ln,i. Let Vn = a„/i H l"'^n,L' = maxj{L„^j} and 

= Ln^i H h L^^kn- 

Theorem 1. Suppose that limsupn^oo 

TnlvV^ < 2-12. Then for all suf- 
ficiently large n, the support of fin belongs to 

I = - cTn/Vn,2y/v^ -\- cTn/Vn), 

where c> is an absolute constant. 



Remark 1. c = 5 will suffices although it is not the best possible. 

Remark 2. Informally, the assumption that lim sup„^o^ T^/vn^^ < 2^^"^ 
means that there are no large outliers. An example of when the assumption 
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is violated is provided by random variables with variance = n ^ and 
Ln^i = 1. Then Tn = n and Vn^"^ = 1, so that Tn/vf/'^ increases when n grows. 

Remark 3. The assumptions in Theorem 1 are weaker than the as- 
sumptions in Theorem 7 of [5]. In particular, Theorem 1 allows us to draw 
conclusions about random variables with nonuniformly bounded support. 
Consider, for example, random variables Xf^, k = 1, . . . ,n, that are sup- 
ported on intervals [—k^^^,k^^^] and have variances of order k"^^^. Then 

Tn has the order of and u„ has the order of n^/^. Therefore, Tn/vf/"^ 
has the order of and Theorem 1 is applicable. It allows us to con- 

clude that the support of Sn = Xi -|- • • • -|- Xn is contained in the interval 
(-2^ - 2^ + cnV3). 

Example 1 (Identically distributed variables). A particular case of the 
above scheme involves the normalized sums of identically distributed, bounded, 

free r.v.'s Sn = \-£,n)/V^- If ll'^ill < then < Ln,i = Ln = 

Lj^/n. Therefore, T„ = / y/n. If the second moment of is cr^, then the 
second moment of the sum Sn is Vn = cr'^- Applying the theorem, we obtain 
the result that starting with certain n, the support of the distribution of 5"^ 
belongs to {-2a - c{L^ /a^)n-^/'^ ,2a + cilJ' j a'^)n-^l'^). 

Example 2 [Free Poisson). Let the nth row of our scheme have kn = n 
identically distributed random variables Xn^i with the Bernoulli distribution 
that places probability pnA on 1 and qn,i = 1 — Pn,i on 0. (It is easy to nor- 
malize this distribution to have the zero mean by subtracting Pn^i-) Suppose 
that maxiPn^i — > as n — > oo and that 

n 

"^Pn,! ^ A > 
1=1 

as n — > oo. Then Ln,i ~ 1 and j =pn,i{l —Pn,i) so that T„ ~ 7i and X 
as n ^ oo. Therefore, Theorem 1 does not apply. An easy calculation for the 
case Pn,i = X/n shows that superconvergence still holds. This example shows 
that the conditions of the theorem are not necessary for superconvergence 
to hold. 

Example 3 (Identically distributed binomial variables). Let Xi be iden- 
tically distributed with a distribution that attributes positive weights p and 
q to —\fqjp and \fpjq, respectively. Then EXi = and EXf = 1. It is 
not difficult to show that the support of Sn = n~^/^ Z^iLi -^i the interval 
/ = [xi,X2], where 



xi,2 = ±2./l--+^ ^ ^ 
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This example shows that the rate of in Theorem 1 camiot be improved 

without fm'ther restrictions. Note, also, that for p > q, Ln is \/p/q and 
therefore the coefficient preceding n~^/^ is of order Ln- In the general bound, 
the coefficient is Li^/a"^. It is not clear whether it is possible to replace the 
coefficient in the general bound by a term of order L„. 



3. Preliminary results. 



Definition 3. The function 

(1) =rm. 

J —oo ^ i 

is called the Cauchy transform of the probability measure fj,{dt). 



The Cauchy transform encodes a wealth of information about the under- 
lying probability measure. For our purposes, we need only some of them. 
First, the following inversion formula holds. 



Proposition 2 (The Stieltjes-Perron inversion formula). For any in- 
terval [a, b] , 

1 f'' 

p[a,b] = — \im— / lmG(x -\- ie) dx, 

elO TT J a 

provided that fj,{a) = fi{b) = 0. 



A proof can be found in [1], pages 124-125. 

We will call a function holomorphic at a point z if it can be represented 
by a convergent power series in a sufficiently small disc with center z. We 
call the function holomorphic in an open domain D if it is holomorphic at 
every point of the domain. Here, D may include {oo}, in which case it is 
a part of the extended complex plane C U {oo} with the topology induced 
by the stereographic projection of the Riemann sphere onto the extended 
complex plane. 

The integral representation (1) shows that the Cauchy transform of every 
probability measure, G{z)^ is a holomorphic function in C"*" = G C| Imz > 
0} and C~ = {z G C| Imz < 0}. If, in addition, the measure is assumed to be 
supported on interval [— then the Cauchy transform is holomorphic in 
the area : |z| > L where it can be represented by a convergent power series 
in z~^ , 



(2) 



^/ s 1 rn-\ mo 
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Here, trik denote the moments of the measure ii: 



In particular, G{z) is holomorphic at {00} . We call series (2) the G-series. 
In the other direction, we have the following result. 

Lemma 1. Suppose that: 

(1) G{z) is the Cauchy transform of a compactly supported probability 
distribution, jjL, and 

(2) G{z) is holomorphic at every z G M, \z\ > L. 

Then the support of ^ lies entirely in the interval [— L,L]. 

Proof. From assumption (1), we infer that in some neighborhood of 
infinity, G{z) can be represented by the convergent power series (2) and 
that G{z) is also holomorphic everywhere in C+ and G~ . Therefore, using 
assumption (2), we can conclude that G{z) is holomorphic everywhere in 
the area = {z||2;| > L} including the point at infinity. 

Let us detail the proof of this statement. Define 



and suppose, by seeking a contradiction, that a > L. Let e be such that 
a — £ > L. Consider the area 0^ = {z\a — e < |z| < a + e}. Since G{z) is 
holomorphic everywhere in C"*" and G~ , it is holomorphic in 0^ \ M. In 
addition, by assumption (2), G{z) is holomorphic at each point of fig nM. 
Therefore, it is holomorphic everywhere in fig and thus it is holomorphic 
everywhere in U {2;||z| > a + e/2} = > a — e}. This contradicts the 

definition of a. Therefore, a< L and G{z) is holomorphic everywhere in the 
area = {2||2| > L}, including the point at infinity. 

It follows that the power series (2) converges everywhere in the area = 
{z\\z\ > L}. Since this power series has real coefficients, we can conclude that 
G{z) is real for z £ R, \z\ > L. Also, since G{z) is holomorphic, and therefore 
continuous, m\z\ > L, we can conclude that lim^^QlmG{z + i£) = 0. Then the 
Stieltjes inversion formula implies fi{[a,b]) =0 for each [a,6]c{xGR||3;|> 
L} provided that /u(a) = and fi{b) = 0. It remains to prove that this implies 
fi{\x\ >L} = 0. 

For this purpose, note that the set of points x G R for which /u(x) > is 
at most countable. Indeed, let S be the set of all x for which //(x) > 0. We 
can divide this set into a countable collection of disjoint subsets Sk, where 
k are all positive integers and Sk = {x\k~^ > fi{x} > {k + Clearly, 
every Sk is either empty or a finite set. Otherwise, we could take an infinite 
countable sequence of Xj ^ G Sk and would get (by countable additivity and 




a = mi{l > 0\G{z) is holomorphic on |z| > 1} 



8 



V. KARGIN 



monotonicity of /i) that fJ.{Sk) > J2if^i^i,k) = +00. By monotonicity of /i, we 
would further get /u(M) = +00, which would contradict the assumption that 
/i is a probability measure. Therefore, /S is a countable union of finite sets 
Sk and hence countable. 

Prom the countability of S, we conclude that the set of points x for which 
fj,{x) = (i.e., S^) is dense in the set |x| > L. Indeed, take an arbitrary 
nonempty interval {a, (3). Then (a,/3) n S"^ / since, otherwise, (a,/3) C S 
and therefore S would be uncountable. Hence, S"^ is dense. Using the dense- 
ness of S^, we can cover the set {|x| > L} by a countable union of disjoint 
intervals [a,b], where /i(a) = and fi{b) = 0. For each of these intervals, 
n{[a, 6]) = and therefore countable additivity implies that > L}) = 0. 

Consequently, fi is supported on a set that lies entirely in [—L,L]. □ 

Definition 4. The inverse of the G-series (2) (in the sense of formal 
series) always exists and is called the K-series, 

G{K{z)) = K{G{z)) = z. 

In case of a bounded self-adjoint random variable A, the G-series are 
convergent for \z\ > \\A\\ and the limit coincides with the Cauchy transform 
G{z). As a consequence, the iT-series is convergent in a sufficiently small 
punctured neighborhood of 0. We will call the limit K[z). This function has 
a pole of order 1 at 0. 

It is sometimes useful to know how functions G{z) and K{z) behave under 
a rescaling of the random variable. 

Lemma 2. (i) GaA{z) = a^^GA{z/a) and (ii) KaA{u) = oiKa{oiu). 

The claim of the lemma follows directly from definitions. 
The importance of i^T-functions is that they allow us to compute the 
distribution of the sum of free random variables. 

Proposition 3 (Voiculescu's addition formula). Suppose that self-adjoint 
r.v.'s A and B are free. Let Ka, Kb and Ka+b be the K-series for variables 
A, B and A + B, respectively. Then 

Ka+b{u) = Ka(u) + Kb{u)--, 

u 

where the equality holds in the sense of formal power series. 

The proof can be found in [19]. Using this property, we can compute the 
distribution of the sum of free r.v.'s as follows. Given two r.v.'s, A and B, 
compute their G-series. Invert them to obtain the ET-series. Use Proposition 
3 to compute Ka+b and invert it to obtain Ga+b- Use the Stieltjes inversion 
formula to compute the measure corresponding to this G-series. This is the 
probability measure corresponding to A + B. 



ON SUPERCONVERGENCE OF SUMS OF FREE RANDOM VARIABLES 9 



4. Proof of Theorem 1. The key ideas of the proof are as follows. 

(1) We know that the Cauchy transform of the sum Sn is the Cauchy 
transform of a bounded r.v. (since, by assumption, each Xn.i is bounded). 
Consequently, the Cauchy transform of Sn is holomorphic in a certain circle 
around infinity (i.e., in the area |z| > ii for some i? > 0). We want to estimate 
R and apply Lemma 1 to conclude that Sn is supported on [—R,R]. 

(2) Since the ET- function of Sn, call it Kn{z), is the sum of the K- 
functions of Xn,i and the latter are functional inverses of the Cauchy trans- 
forms of Xn,i, it is an exercise in complex analysis to prove that the K- 
function of Sn takes real values and is a one-to-one function on a sufficiently 
large real interval around zero. Therefore, it has a differentiable functional 
inverse defined on a sufficiently large real interval around infinity (i.e., on 
the set / = (— oo, —A] U [A, oo) for some A which we can explicitly estimate). 
Moreover, with a little bit more effort, we can show that this inverse func- 
tion is well defined and holomorphic in an open complex neighborhood of 
/. This shows that Lemma 1 is applicable, and the estimate for A provides 
the desired estimate for the support of Sn- 

We will begin by finding the radius of convergence of the Taylor series 
of Kn{z). First, we need to prove some preliminary facts about Cauchy 
transforms of Xn^i- 

Define gn,i{z) = Gn,i{z~^). Since the series Gn,i{z) are convergent every- 
where in \z\ > Ln,i, the Taylor series for gn,i{z) converges everywhere in 



2. \Gn,i{z)\ > run^i > everywhere on \z\ = Rn,i] 

3. gn,i{z) has only one zero in \z\ < R']- 

For example, we can take = 2Ln.i and m„^j = (4L„.j)~^. Indeed, for 
any z with \z\ = r > Ln,i, we can estimate Gn,i{z): 



Assume that Rn,i and mn,i are such that: 






1 




r 



r — L, 




'n,i 
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valid for every i and everywhere on \z\ = 2L„^j. 

It remains to show that gn,i{z) has only one zero in \z\ < (2L„^,j)~^. This 
is indeed so because 

and we can estimate 

Therefore, Rouche's theorem is applicable and gn,i has only one zero in 
\z\ < (2L„,,)-i. 

Definition 5. Let i?„ = inaxi{Rn,i} , mn = minj{mn,j} and Dn = 

We are now able to investigate the region of convergence for the series 
Kn,i{z). First, we prove a modification of Lagrange's inversion formula. 

Lemma 3. Suppose w = G{z) (where G is not necessarily a Cauchy 
transform) is holomorphic in a neighborhood of zq = 00 and has the ex- 
pansion 

1 Oi 

- + -2 ' 

Z Z'^ 

converging for all sufficiently large z. Define g{z) = G{l/z). Then the inverse 
of G{z) is well defined in a neighborhood of and its Laurent series at is 
given by the formula 



G{z) = -+ 2 



z = G ^{w) = — + ai-Y^ 

n=l 



2TTin Jd-y z^g{zy 
where ^ is a sufficiently small disc around 0. 

Proof. Let 7 be a closed disc around z = in which g[z) has only one 
zero. This disc exists because g{z) is holomorphic in a neighborhood of 
and has a nonzero derivative at 0. Let 

r^ = i inf \g{z)\. 

Then r^ > 0, by our assumption on 7. We can apply Rouche's theorem and 
conclude that the equation g{z) — w = has only one solution inside 7 if 
<rw. Let us consider a w such that \w\ <rw. Inside 7, the function 

g'jz) 

z{g{z) - w) 
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has a pole at z = 1/G ^{w) with the residue G ^{w) and a pole at z = 
with the residue —1/w. Consequently, we can write: 

g'{z)dz 



1 

27rz Id'y z{g{z) — w) w 



The integral can be rewritten as follows: 

g'iz)dz r g'{z) 



d-yz{g{z)-w) Jdj zg{z)l-w/g{z) 



dz 



E 



g'iz)dz ^„ 



For n = 0, we calculate 



1 



g'{z) dz 



ai. 



2-Ki Jd-y zg{z) 

Indeed, the only pole of the integrand is at z = 0, of order two, and the 
corresponding residue can be computed from the series expansion for g{z): 

g'{z)dz d z^{l + 2aiz + 



res^=o- 



zg{z) dz z{z + aiz"^ + ■ 
d l + 2aizH 



dz \ + aiz + ■ 

For n > 0, we integrate by parts: 

1 f g'{z)dz _ 



z=0 
= Ol. 



^=0 



1 1 



27ri Id-y zg{z)'^~^^ 2Tii n Joj z'^g{z)"- 



dz 



□ 



Lemma 4. The radius of convergence of the K-series for measure fin is 
at least rUn- 

The lemma essentially says that if r.v.'s Xn^i, . . . ,Xn.k„ are all bounded 
by Ln, then the if-series for J2i-^n,i converges in the circle \z\ < 1/(4L„). 

Proof. Let us apply Lemma 3 to Gn,i{z) with 7 having radius {Rn,i)~^ ■ 
By Lemma 3, the coefficients in the series for the inverse of Gn,i{z) are 



dz 



2TTik Jd^ z'^gn,i{zY 



and we can estimate them as 
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This implies that the radius of convergence of the X-series for measure fj,n,i 
is mn,i- Consequently, the radius of convergence of the iC-series for measure 
fin is at least m^. □ 

We can now investigate the behavior of Kn{z) and its derivative inside 
its circle of convergence. 

Lemma 5. For every z in \z\ < rUn, the following inequalities are valid: 



(3) 
(4) 



<Dn\z\\ 



Note that is approximately A:„L^, so the meaning of the lemma is that 
the growth of Kn — z~^ — VnZ around z = is bounded by a constant that 
depends on the norm of the variables Xn,i> • • • ,Xn,k„- 

Proof. Consider the circle of radius mn,i/'^- We can estimate Kn^i inside 
this circle: 



1 



(2) 



< 



Rn.A , \—2\ 1 2 , ^nA / \— 3i 12^1^,1 

-{mn,i) \z\ +^T^{mn,i) ^\z\ 



+ 



Rn'i 



{mn,i) \z 



3| |2 ' '^n,i 

22 



+ 



„ , ,_2, ,2/1 1111 \ 
< Rn,i{mnA)~'^\z\'^ . 

Consequently, using Voiculescu's addition formula, we can estimate 



(5) 



Kn{z) 



■ VnZ 



<Dn\z\ 



and a similar argument leads to the estimate 

1 



(6) 



K{z) + 



< 2Dn\z\. 



□ 



Lemma 6. Suppose that (i) m„>4/y^ and (ii) > 4Z)„/f2. Then 
there are no zeros of K'^{z) inside \z\ < — r„. 

In other words, Kn{z) has no critical points in a circle which is sufficiently 
separated from z = ±l/y/u^. 
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— 1/2 

Proof of Lemma 6. If r„ > i)„ , then the set \z\ < — r„ is 

— 1/2 

empty and we are done. In the following, we assume that rn < Vn ■ On 

I I -1/2 1 I 1-2 Ai I -1/2|| , -1/2| -1/2 

\z\ = Vn — Tn, We have \z\ > Vn- Also, \Z — Vn \\Z + Vn \ > TnVn 

This is easy to see by considering the two cases Hez > and Rez < 0. In 
the first case, \z — Vn^^'^l > rn and \z + Vn^^'^l > Vn^^'^ ■ In the second case, 
\z — Vn^^'^\ > Vn^^"^ and Iz + iJn"'^^^! > Tn- Hence, in both cases, the product 

I -1/2|| , -1/2, ^ -1/2 

\Z-Vn \\Z + Vn > TnVn 



Therefore, 



-2,1 I |-2| -1/2|| I -1/2| 

-Z -]-Vn\=Vn\z\ \z - Vn ' \\z + V^ ' \ 



— 1/2 "^12 
>VnVnrnVn ' = ^'n V • 

Since rn + Vn^^"^ < 2vn^^'^, assumption (i) imphes that rn + Vn^^'^ < mn/2. 

— 1/2 

Hence, the circle = fn + lies entirely in the area where formula (4) 
applies to K!^{z). Consequently, using (4), we can estimate 

\K{z)-{-Z-^+Vn)\<4.DnVn'/\ 

where we used the fact that \z\ < 2vn^^'^- By assumption (ii), r„ > ADnV~'^, 
therefore Vn rn > 4:DnVn and Rouche's theorem is applicable to the pair 
of K!^{z) and —z~'^ + Vn- Both K'^{z) and —z~'^ + Vn have only one pole, of 
order two, in |z| < v~^/'^ — rn and the function — + Vn has no zeros inside 
\z\ < — rn- Therefore, Rouche's theorem implies that there are no zeros 
of K'^{z) inside \z\ < f^^^^ — r„. (Rouche's theorem is often formulated only 
for holomorphic functions, but as a consequence of the argument principle 
(see, e.g.. Theorems II. 2. 3 and II. 2.4 in [12]), it can be easily reformulated for 
meromorphic functions. In this form, it claims that a meromorphic function, 
f{z), has the same difference between the number of zeros and number of 
poles inside a curve 7 as another meromorphic function, g{z) , provided that 
|/(z)| > \g{z) — f{z)\. For this formulation see, e.g., [9], Theorem 9.2.3.) □ 

Condition 2. Assume, in the following, that r„ = ADn/Vn- 

We now use our knowledge about the location of critical points of Kn{z) 
to investigate how it behaves on the real interval around zero. 

0/9 

Lemma 7. Suppose that nin > 4/y^ and Dn/vn < 1/8. Then Kn{z) 
maps the set [— l/v^ + r„, 0) U (0, 1/y^ — r„] in a one-to-one fashion onto 
a set that contains the union of two intervals (—00, —2^/v^ — cDn/vn) U 
{2^Jn^-\- cDn/vmOo), where c is a constant that does not depend on n. 



Remark. For example, c = 5 will work. 
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Proof of Lemma 7. The assumption that m„ > A/ ^/v^ ensures that 
the power series for Kn{z) converges in |2;| < z ^0. Note that Kn{z) 

is real- valued on the set /= [— l/y^+ r„, 0) U (0, — r„] because this 
set belongs to the area where the series for Kn{z) converges and the coef- 
ficients of this series are real. Moreover, by Lemma 6, there are no critical 
points of Kn{z) on / [i.e., for every z e /, K'^{z) ^ 0], therefore Kn{z) must 
be strictly monotonic on subintervals [— l/y^ + r„,0) and (0, Xj ^fn^— r„]. 
Consequently, K„(/) = (-oo, K„(-l/^/i^ + r„)] U [A'„(l/^/?};r- r„), oo). We 
claim that /f„(l/^t^ - r„) < 2^f%hi + ^Dn/vn and Kn{-1 / y/v:;^ + r„) > 
-2y/n;i-bDn/vn. 

Indeed, if we write 



Kn{z) = -+ VnZ + h{z), 



then 



1 



According to our assumption, r„y^ = ADn/vn < 1/2. Therefore, we can 
estimate 



1 



and 



< 1 + 2rny/v^ 



We can estimate the last term using Lemma 5 as 



h 



1 



1 



Dn/Vn- 



Combining all of this and substituting r„ = ADn/v^, we get 

1 



Similarly, we can derive that 



< 2y/v^+5Dn/Vn- 



+ rn] > -2^/u^- 5Dn/Vn- 



□ 



From the previous lemma, we can conclude that Kn{z) has a differentiable 
inverse defined on (— oo, —2^/v^ — cDn/vn) U {2^/v^ + cDn/vn, oo). We can 
extend this conclusion to an open complex neighborhood of this interval. 
This is achieved in the next two lemmas. 
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Lemma 8. As in the previous lemma, suppose that nin > and 

0/9 

Dn/vn < 1/8. Let z be an arbitrary point of the interval [— l/y^+r^, — 
r„] . Then we can find a neighborhood Uz of z and a neighborhood of 
w = Kn{z) such that Kn is a one-to-one map of Uz onto Ww and the in- 
verse map is holomorphic everywhere in Ww 

Proof. Since the power series for Kn{z) — z~^ converges in \z\ < A/^/v^, 
the function Kn{z) is holomorphic in \z\ < z ^ 0. In addition, by 

Lemma 6, z G [—l/y/v^+rn,l/^/^hl — rn] is not a critical point of Kn{z). 
Therefore, for z ^ 0, the conclusion of the lemma follows from Theorems 
IL3.1 and IL3.2 in [12]. For z = 0, the argument is parallel to the argument 
in Markushevich, except for a different choice of local coordinates. Indeed, 
f{z) = 1/Kn{z) is holomorphic at ^; = 0, it maps z = to w = and f'{z) = 
1 7^ at z = 0. Therefore, Theorems II. 3.1 and II. 3. 2 in [12] are applicable 
to f{z) and it has a well-defined holomorphic inverse in a neighborhood of 
w = 0. This implies that Kn{z) has a well-defined holomorphic inverse in a 
neighborhood of 00, given by the formula K-^{z) = /-^(l/z). □ 

Lemma 9. Local inverse K~^{z) defined in the previous lemma is a re- 
striction of a function Gn{z) which is defined and holomorphic everywhere in 

1 /2 1/2 

a neighborhood of I = {cxd} U (—00, —2vn — cDn/vn] U [2vn + cDn/vn, 00). 
The function Gn{z) is the inverse of Kn{z) in this neighborhood. 

Proof. By Lemma 7, for every point w £ L, we can find a unique z € 
[— 1/y^ + r„,, 1/y^ — r„,] such that Kn{z) =w. Let Uz and be the 
neighborhoods defined in the previous lemma. Also, let us write {K~^ jW^) 
to denote the local inverses defined in the previous lemma together with 
their areas of definition. Our task is to prove that these local inverses can 
be joined to form an analytic function K~^, well defined everywhere in a 
neighborhood of /. We will do this in several steps. 

First, an examination of the proof of the previous lemma and Theorem 
II. 3.1 in [12] shows that we can take each Uz in the form of a disc. Let Uz = 
Uz/3, that is, define Uz as a disc that has the same center, but radius one 
third that of Uz- Define as Kn{Uz)- These new sets are more convenient 
because of the property that if Uz^ H Uz2 7^ 0, then either Uzi U Uz2 C Uzi or 
Uzi U Uz2 C Uz2 - In particular, this means that if Uz-^ n Uz2 / 0, then Kn{z) 
is a one-to-one map of f/^^ U Uz2 onto W^i U Ww2 ■ This is convenient because 
Kn is one-to-one not only on a particular neighborhood C/^i j but also on the 
union of every two intersecting neighborhoods Uz-^ and f/^j . Let us call this 
the extended invertibility property. 

Next, define an even smaller U z with the following properties: (1) U z C Uz', 

(2) Wyj =: Kn{U z) is either an open disc for z / or the set \w\ > R for 
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z = 0; and (3) ^ Ww This is easy to achieve by taking an appropriate 

open subset of as and applying K~^. Note that the property of the 
previous paragraph (i.e., extended invertibiUty) remains vahd for the new 

sets Uz ■ _ 

Discs Ww form an open cover of / and the corresponding sets Uz form an 
open cover for K~^{I), which is a closed interval contained in [— l/y^ + 
r„, r„]. Let U^, i = 0, . . . , N ,he a finite cover of K~^{I) selected from 

{U z}- [Recall that is well defined on the interval I by Lemma 7, hence 
K~^{I) is well defined.] We can find a finite cover due to the compactness of 
K~^{I). Further, let W^ =: Kn{Ui) be the corresponding cover of /, selected 

from {VFx}. For convenience, let Wq denote the set for w = oo. Finally, 
let R = Uilo Ui and S = Uilo ^i- Sets R and S are illustrated in Figure 1. 

Clearly, S is open. We aim to prove that S is simply connected in the 
extended complex plane CU {oo}. For this purpose, let us define the defor- 
mation retraction Fi of the set S as follows: (1) if z € Wq, then z — > z; (2) if 
z ^ Wq, then z — > Rez + (1 — t) Imz. Here, parameter t changes from to 1. 
(For the definition and properties of deformation retractions, see, e.g., [8]; 
the definition is on page 2 and the main property is in Proposition 1.17.) 
This retraction reduces S* to a homotopically equivalent set S' that con- 
sists of Wq and two intervals of the real axis that do not include 0. We can 
then use another deformation retraction F2 that sends z to {l — t)~^z. This 
retraction reduces S' to S" = {00}, which is evidently simply connected. 

We know that there is a holomorphic inverse K~^(z) defined on each 
of Wi. Starting from one of these domains, say Wq, we can analytically con- 
tinue K~^{z) to every other Wi. Indeed, take points zq € C/q and Zi G Ui 
and connect them by a path that lies entirely in i? = IJi^o ^« • This path 
corresponds to a chain {C/fc^}, s = 1, . . . ,n, that connects Uq and Ui. That is, 
Uki = Uq, Uk„ = Ui and Uk^ D Ukj^^ + 0. The corresponding Wk, = KniUkJ 
form a chain that connects Wq and Wj, that is, Wk^ = Wq, Wk„ = Wi and 
Wk - n Wk _^_^ / 0. By our construction, this chain of sets Wk^ has the prop- 
erty that K-\Wk^)n R-^Wk^^,) = U^^ n U^^^, ^ 0. 

Consider two adjacent sets, Wk^ and W^fc^+i , in this chain. Then the 
corresponding local inverses {K~^ ^W^^) and {K~^ ,Wkj^-^), which were de- 
fined in the previous lemma, coincide on an open nonempty set. Indeed, 
Kn{Uk^ n C/fc^.^J C ifn(f/fc,) n if„(C/fc^+J = Wk^ n Wk^^^, therefore the func- 
tions {K~^,Wkj) and {K~^,Wk^_^^) are both well defined on Kn{Uk^ D Uk^^^)- 
Moreover, they must coincide on Kn{Ukj r\Ukj_^j)- Indeed, by construction, 
Ukj n C^fc^+i / and therefore, by the extended invertibility property, Kn is 
one-to-one on Ukj U Ukj^-^ - Hence, there cannot exist two different z and z' 
in Ukj U Ukj^-^ that would map to the same point in Kn{Uk^ H Uk^^-^)- Hence, 
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{K^^,Wkj) and (K„\VVfc^._^J must coincide on Kn{Uk^ n J/fc^+J, which is 
open and nonempty. 

Using the property that if two analytical functions coincide on an open 
set, then each of them is an analytic continuation of the other, we conclude 
that the local inverse {K~^,Wkj) can be analytically continued to W^fcj+D 
where it coincides with the local inverse {K~^ ,Wkj^^)- Therefore, at least 
one analytic continuation of {Wq,K~^) is well defined everywhere on S and 
has the property that when restricted to each of Wj, it coincides with a local 
inverse of Kn{z) defined in the previous lemma. Since 5 is simply connected, 
the analytic continuation is unique, that is, it does not depend on the choice 
of the chain of the neighborhoods that connect Wq and Wj. 

Let us denote the function resulting from this analytic continuation as 
Gn{z)- By construction, it is unambiguously defined for every Wj and the re- 
strictions of Gn{z) to Wj coincide with {K~^ ,Wj). Therefore, Gn{z) satisfies 
the relations Kn{Gn{z)) = z and Gn{Kn{z)) = z everywhere on i? = IJi^o ^« 
and on 5 = UiIo^i- 

Since S is an open neighborhood of I, every claim of 
the lemma is proved. □ 

Lemma 10. The function Gn{z) constructed in the previous lemma is 
the Cauchy transform of Sn ■ 

By construction, G~^{z) is the inverse of Kn{z) in a neighborhood of 

1/2 1/2 

{oo} U (— oo,— 2?;n — cDn/vn) U {2vn + cDn/vn,oo). In particular, it is 
the inverse of Kn{z) in a neighborhood of infinity. Therefore, in this neigh- 
borhood, it has the same power expansion as the Cauchy transform of Sn- 
Therefore, it coincides with the Cauchy transform of Sn in this neighbor- 
hood. Next, we apply the principle that if two analytical functions coincide 
in an open domain, then they coincide at every point where they can be 
continued analytically. 

It remains to apply Lemma 1 in order to obtain the following theorem. 

Theorem 3. Suppose that (i) liminf > 4 and (ii) limsup„_^oo D„/ 

Vn"^ < 1/8. Then for all sufficiently large n, the support of fin belongs to 

I = {-2./v^ - cDn/Vn,2,/v^, + cDn/Vn), 

where c> is an absolute constant (e.g., c = 5). 

Proof of Theorem 3. Let us collect the facts that we know about the 
function Gn{z) defined in Lemma 9. First, by Lemma 10, it is the Cauchy 
transform of a bounded random variable, Sn- Second, by Lemma 9, it is 

1/2 

holomorphic at all z G M such that \z\> 2vn + cDn/vn- Using Lemma 1, 
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1/2 

we conclude that the distribution of Sn is supported on the interval [—2vn — 

1/2 

cDn/Vn,2Vn + cDn]. □ 

If we take Rn,i = 2Ln^i and nin^i = (ALn^i)^^, then assumption (i) is equiv- 
alent to 



which is equivalent to 



From (ii), we obtain 

1/8 > hm sup :r-p: = lim sup :r-p: 



liminf min " > 4, 



lim sup —— < 16. 



which is equivalent to 



Tn 

limsup^T^r < 1/256. 



Finally, note that the condition lim sup^^g^ T„,/fn^^ < 2"-*^^ implies that 
limsup„^oQ L„/y^ < 16. Therefore, Theorem 1 is a consequence of Theo- 
rem 3. 
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